Nile's Distributed Computing Site Architecture
نویسندگان
چکیده
The goal of the Nile project is to manage distributed computing resources, making an arbitrarily large cluster of commodity computers appear to the user as a seamless uniprocessor environment. This paper describes Nile's software architecture. 1 The Nile Project The Nile project 2,4] is building a distributed computing solution for CLEO III 3]. The goal is to provide a self-managing, fault-tolerant, heterogeneous system of hundreds of commodity workstations, with access to a distributed database in excess of 100 TB. Nile will allow any resource to be used transparently by any member of the collaboration, from anywhere within the collaboration. Nile must outlive its development phase, adapt to and scale with changes in CLEO's computing needs, be easily maintained, and be able to incorporate new software components as they become available; this led us to a distributed architecture based on CORBA. To program replicated objects we are using Electra 5], an ORB built on top of a reliable communication substrate; currently, Electra has adaptors for Isis 1] and Horus 7]. 2 Nile Architecture In Nile's software architecture, each of CLEO's collaborating institutions is a site, with a local set of users, and computational and data resources that process jobs submitted by local users or other Nile sites. The preference is to use local resources only, but the great disparity in sites' resources will require
منابع مشابه
Experience with Distributed Replicated Objects: The Nile Project
1 The Nile Project The Nile project is developing a distributed computing solution for the CLEO High Energy Physics experiment 3]. The goal is to provide a self-managing, fault-tolerant, heterogeneous system of hundreds of commodity workstations, with access to a distributed database in excess of 100 TB. These resources are spread across the United States and Canada at 24 collaborating institut...
متن کاملUse of Network Modeling Tools in the Nile System Design
I discuss the relevance of network modeling tools to high energy physics experiments, particularly for understanding data ows and identifying possible bottlenecks during data analysis. The Nile collaboration is using these tools to model a fault-tolerant computing system distributed over geographically dispersed sites. I summarize Nile's evaluation of three commericial tools and our initial exp...
متن کاملA Review on Software Engineering Methods for Distributed Systems
Today we collect data from different size of data, different locations and different type with a large scale in each site. Current computer server systems cannot process and collect these big data. For this issue, distributed computing system proposed in the literature. Supercomputers are changed to distribute computing, such as cloud computing systems. Software engineering is a part of each so...
متن کاملAn Architecture for Security and Protection of Big Data
The issue of online privacy and security is a challenging subject, as it concerns the privacy of data that are increasingly more accessible via the internet. In other words, people who intend to access the private information of other users can do so more efficiently over the internet. This study is an attempt to address the privacy issue of distributed big data in the context of cloud computin...
متن کاملDistributed Query Scheduling Service: An Architecture and Its Implementation
We present the systematic design and development of a distributed query scheduling service (DQS) in the context of DIOM, a distributed and interoperable query mediation system [26]. DQS consists of an extensible architecture for distributed query processing, a three-phase optimization algorithm for generating e cient query execution schedules, and a prototype implementation. Functionally, two i...
متن کامل